A Phonological Modeling System Based on Autosegmental and Articulatory Phonology
نویسندگان
چکیده
This paper describes the design and implementation of a phonological modeling system based on autosegmental and articulatory phonology and its application in speech recognition. Pronunciation modeling is an integral part in speech recognition systems. Together with language modeling, signal processing and learning models (e.g. Hidden Markov model and neural network model), it innuences the performance of a speech recognition system. The basic units adopted by most speech recognition systems for word pronunciation modeling are linearly ordered diphones or triphones (i.e. context-dependent segments without internal structure). Our approach decomposes segments into articulatory features and allows these features to spread freely into neighbouring segments to model continuous speech. We use a feature-based nite-state transducer as the metalanguage for writing phonological rules. A set of rules has been implemented in this language. The writing of these rules are based on theories of autosegmental and artic-ulatory phonology. The numerical components of the rules are deduced from articulator trajec-tory data. The output of this system is gestural scores that model phonological alternations in continuous speech. The gestural scores are used to generate Hidden Markov Models for building speech recognition systems.
منابع مشابه
One-Level Phonology: Autosegmental Representations and Rules as Finite Automata
When phonological rules are regarded as declarative descriptions, it is possible to construct a model of phonology in which rules and representations are no longer distinguished and such procedural devices as rule-ordering are absent. In this paper we present a finite-state model of phonology in which automata are the descriptions and tapes (or strings) are the objects being described. This pro...
متن کاملPhonological Events
One of the major innovations within post-SPE generative phonology has been the development of frameworks where phonological units are organised in a non-linear fashion. Taking autosegmental phonology (Goldsmith 1976) as our main exemplar of such frameworks, we wish to address the following question: What is the appropriate interpretation of autosegmental representations? There is, of course, a ...
متن کاملSegmental anchoring of pitch movements: Autosegmental association or gestural coordination?
Arvaniti, Ladd and Mennen (1998) reported a phenomenon of ‘segmental anchoring’: the beginning and end of a linguistically significant pitch movement are anchored to specific locations in segmental structure, which means that the slope and duration of the pitch movement vary according to the segmental material with which it is associated. This finding has since been replicated and extended in s...
متن کاملOptimality Theory and African Language Phonology
ACAL, held at the University of Illinois in 1989. At that time, the dominant research paradigm was autosegmental phonology, a theory which is concerned with issues in the representation of distinctive features (Goldsmith 1976; Leben 2006). Work on African language phonology was so central to the development of autosegmental phonology, that Goldsmith (1990) in his presentation felt no need to de...
متن کاملThe Intonational Phonology of Catalan
This chapter presents an analysis of the prosodic and intonational structure of Catalan within the Autosegmental-Metrical (AM) framework (Pierrehumbert 1980, Pierrehumbert and Beckman 1988, Ladd 1996, Gussenhoven 2004, Jun 2005, and Beckman et al. 2005, among others). Based on this analysis, we have developed the Cat_ToBI system of prosodic annotation of Catalan corpora (Prieto, Aguilar, Mascar...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007